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Abstract 



O . These lectures provide an informal introduction into the notions and tools used to analyze 



statistical properties of eigenvalues of large random Hermitian matrices. After developing the 
general machinery of orthogonal polynomial method, we study in most detail Gaussian Unitary 
■ Ensemble (GUE) as a paradigmatic example. In particular, we discuss Plancherel-Rotach 

asymptotics of Hermite polynomials in various regimes and employ it in spectral analysis 
of the GUE. In the last part of the course we discuss general relations between orthogonal 
^ ' polynomials and characteristic polynomials of random matrices which is an active area of 

' current research. 

o 

C^l ■ 1 Preface 



Gaussian Ensembles of random Hermitian or real symmetric matrices always played a prominent 
role in the development and applications of Random Matrix Theory. Gaussian Ensembles are 
' uniquely singled out by the fact that they belong both to the family of invariant ensembles, and 

to the family of ensembles with independent, identically distributed (i.i.d) entries. In general, 
<^ ' mathematical methods used to treat those two families are very different. 

"j^ , In fact, all random matrix techniques and ideas can be most clearly and consistently introduced 

using Gaussian case as a paradigmatic example. In the present set of lectures we mainly concentrate 
on consequences of the invariance of the corresponding probability density function, leaving aside 
^ ■ methods of exploiting statistical independence of matrix entries. Under these circumstances the 

^> , method of orthogonal polynomials is the most adequate one, and for the Gaussian case the relevant 

^ • polynomials are Hermite polynomials. Being mostly interested in the limit of large matrix sizes 

we will spend a considerable amount of time investigating various asymptotic regimes of Hermite 
polynomials, since the latter are main building blocks of various correlation functions of interest. 
In the last part of our lecture course we will discuss why statistics of characteristic polynomials 
of random Hermitian matrices turns out to be interesting and informative to investigate, and will 
make a contact with recent results in the domain. 

The presentation is quite informal in the sense that I will not try to prove various statements 
in full rigor or generality. I rather attempt outlining the main concepts, ideas and techniques 
preferring a good illuminating example to a general proof. A much more rigorous and detailed 
exposition can be found in the cited literature. I will also frequently employ the symbol oc. In the 
present set of lectures it always means that the expression following oc contains a multiplicative 
constant which is of secondary importance for our goals and can be restored when necessary. 

2 Introduction 

In these lectures we use the symbol to denote matrix or vector transposition and the asterisk * 
to denote Hermitian conjugation. In the present section the bar z denotes complex conjugation. 
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Let us start with a square complex matrix Z of dimensions N x N, with complex entries 
Zij = Xij + iUij, 1 < i,j < N. Every such matrix can be conveniently looked at as a point in a 
2iV^-dimensional Euclidean space with real Cartesian coordinates Xij , Uij , and the length element 
in this space is defined in a standard way as: 

{dsf = Tr [dZdZ*) = dzijd^ = [idx)l + idy)^] . (1) 

ij ij 

As is well-known (see e.g.[l]) any surface embedded in an Euclidean space inherits a natural 
Riemannian metric from the underlying Euclidean structure. Namely, let the coordinates in a 
n— dimensional Euclidean space be {xi,...,Xn), and let a fc— dimensional surface embedded in 
this space be parameterized in terms of coordinates {qi,. . . , Qk), k < n as Xi = Xi{q-i, . . . , Qk), i = 
1 , . . .n. Then the Riemannian metric Qrni 

= gim on the surface is defined from the EucUdean length 

element according to 

n n / k Q k 

{dsf = ^{dxif = X! X! -K-^dqm = ^ Qmndqmdqi. (2) 

z— 1 i—l \m—l / m,/— 1 

Moreover, such a Riemannian metric induces the corresponding integration measure on the surface, 
with the volume element given by 

c^M = VWldli ■ ■ ■ dqk, 9 = det (5mOf,m=i- (^) 

For k = n these are just the familiar formulae for the lengths and volume associated with 
change of coordinates in an Euclidean space. For example, for n = 2 we can pass from Cartesian 
coordinates — oo < a;, y < oo to polar coordinates r > 0, < ^ < 27r by a; = rcos^, y = rsin^, so 
that dx = d,r cos — r sin OdO, dy = dr sin + r cos OdO, and the Riemannian metric is defined by 
(dsf = {dxf + {dyf = {drf + r'^{d9f. We find that gn = 1, 512 = 321 = 0, 522 = r'^, and the 
volume clement of the integration measure in the new coordinates is d/i = rdrdO; as it should be. 
As the simplest example of a "surface" with k < n = 2 embedded in such a two-dimensional space 
we consider a circle r — R ~ const. We immediately see that the length element (ds)^ restricted 
to this "surface" is {ds)"^ ~ R'^{dd)^, so that gn = R^, and the integration measure induced on the 
surface is correspondingly dfi — RdO. The "surface" integration then gives the total "volume" of 
the embedded surface (i.e. circle length 2'kR). 




Figure 1: The spherical coordinates for a two dimensional sphere in the three-dimensional Euclidean 
space. 
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Similarly, we can consider a two-dimensional (fc = 2) sphere i?^ = -\- y'^ + embedded in 
a three-dimensional Euclidean space (n = 3) with coordinates x, y, z and length element (rfs)^ = 
(dx)"^ + {dyY + {dzY . A natural parameterization of the points on the sphere is possible in terms 
of the spherical coordinates (j),9 (see Fig. 1) 

X = RsuyO coscj), y = RsinOsincj), z = Rcos9; < < n, < <j) < 27r, 

which results in (ds)^ = R^{d9)^ + R^ svc? 9{d(l)f' . Hence the matrix elements of the metric are 
gii = R^, gi2 = §21 = 0, <?22 = R? sin^ 6*, and the corresponding "volume element" on the sphere is 
the familiar elementary area d\i = R^ sin 9d9d(j). 

As a less trivial example to be used later on consider a 2— dimensional manifold formed by 2 x 2 
unitary matrices U embedded in the 8 dimensional Euclidean space of Gl{2;C) matrices. Every 
such matrix can be represented as the product of a matrix tJc from the coset space U{2) /U (1) x U{1) 
parameterized by fc = 2 real coordinates Q < 4> < 27r, < ^ < 7r/2, and a diagonal unitary matrix 
Ud, that \sU = UJJc^ where 

J _ ( C0&9 -sin6'e-*'^ \ rj _ ( e'"*'^ \ , , 

^^-^.sin^e^^ cos^ J' e^'t'- ) ' 

Then the differential dU of the matrix U = UdUc has the following form: 

^Tj - ( -[d9ain9 + icos9d4)i\e-"l'^ e-*('^i+'^)[-d6'cos6' + ^(#1 + sin6'] \ , . 
V e'('^+"^=) [rf6l cos 9 + i{d(l) + #2) sin 61] [-d9 sin 9 + i#2 cos 9]e''t>-^ J ' 

which yields the length element and the induced Riemannian metric: 

{dsf = Tr (dUdU*^ = 2{d9f + (#1)^ + (#2)^ + 2sin2 9{d(t)f + 2sm^ 9{d(l)d(l)i + ##2). (6) 

We sec that the nonzero entries of the Riemannian metric tensor gmn in this case are 51 1 = 
2, 522 = 533 = 1, 544 = 2 sin^ 9, 523 = 524 = 532 = 534 = sin^ 9, so that the determinant det [gmn] = 
4sin^ ^cos^ 9. Finally, the induced integration measure on the group U{2) is given by 

dii{U) = 2 sin 9 cos 9 d9 dcp dcpi #2 • (7) 

It is immediately clear that the above expression is invariant, by construction, with respect to 
multiplications U — > VU, for any fixed unitary matrix V from the same group. Therefore, Eq.(7) 
is just the Haar measure on the group. 

We will make use of these ideas several times in our lectures. Let us now concentrate on the 
AT^— dimensional subspace of Hermitian matrices in the 2N^— dimensional space of all complex 
matrices of a given size A'^. The Hermiticity condition H = H* = H'^ amounts to imposing the 
following restrictions on the coordinates: xij = Xji, yij = —yji- Such a restriction from the space 
of general complex matrices results in the length and volume element on the subspace of Hermitian 
matrices: 

{dsf = Tr (dHdH*j = Y^idxiif + 2 ^ [{dxi.f + {dyi.f] (8) 

i i<j 

W(JV-l) -| — |- ~i — r 

dfi{H) = 2 4 ||da;ii||(ia;ij(ij/y. (9) 

i i<j 

Obviously, the length clement (d.s)^ = TidHdH* is invariant with respect to an automorphism (a 
mapping of the space of Hermitian matrices to itself) by a similarity transformation H —> U~^HU, 
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where U G U{N) is any given unitary N x N matrix: U* = U~^. Therefore the corresponding 
integration measure d^{H) is also invariant with respect to all such "rotations of the basis". 

The above-given measure d^{H) written in the coordinates xa, Xi^j, yi<^j is frequently referred 
to as the "flat measure" . Let us discuss now another, very important coordinate system in the space 
of Hermitian matrices which will be in the heart of all subsequent discussions. As is well-known, 
every Hermitian matrix H can be represented as 

H = UkU-^, A = diag(Ai,...,Ajv), = (10) 

where real — oo < Afe < oo, k = 1, . . . ,N are eigenvalues of the Hermitian matrix, and rows of the 
unitary matrix U are corresponding eigenvectors. Generically, we can consider all eigenvalues to 
be simple (non-degenerate). More precisely, the set of matrices H with non-degenerate eigenvalues 
is dense and open in the A/'^-dimensional space of all Hermitian matrices, and has full measure 
(see [3], p. 94 for a formal proof). The correspondence H ^ {u & U{N),k^ is, however, not 

one-to-one, namely UiAU^^ = 1)2^)2'^ if f/f^[/2 = diag (e*"^! , . . . , e^"^" ) for any choice of the 
phases 4>i, . . . ,(j)N- To make the correspondence one-to-one we therefore have to restrict the unitary 
matrices to the coset space U{N)/U{l)i^. . .(8)C/(1), and also to order the eigenvalues, e.g. requiring 
Ai < A2 < . . . < Xn- Our next task is to write the integration measure dn{H) in terms of 
eigenvalues A and matrices U. To this end, we differentiate the spectral decomposition H = UALf*, 

and further exploit: d (u*U^ = dU*U + U*dU = 0. This leads to 



dH = U 



dA + U*dUA - AU*dU 



U*. (11) 



Substituting this expression into the length element (ds)^, see Eq.(8), and using the short- hand 
notation 5U for the matrix U*dU satisfying anti-Hermiticity condition dU* = —SU, we arrive at: 



{dsf = Tr 



(d^y + 2dA (^5UA - A6U^ + (5/7 a)V (A(5[/)^ - 25UA?5U 



(12) 



Taking into account that A is purely diagonal, and therefore the diagonal entries of the commutator 
{stlA — A51J^ are zero, we see that the second term in Eq.(12) vanishes. On the other hand, the 
third and subsequent terms when added up are equal to 



2Tr 



SUASUA - dV^A} 



= 2^ [5Ui,X,5U,iX, - Xj6U,j6Uj^] = - ^ (A, - A,)' 5Uj,6Uij , 



which together with the first term yields the final expression for the length element in the "spectral" 
coordinates 

{dsf = {dXif + ^ (Ai - XjfJU~5U,, (13) 

i i<j 



where we exploited the anti-Hermiticity condition —6Uji = 6Uij. Introducing the real and imag- 
inary parts SUij = Spij + iSqij as independent coordinates we can calculate the corresponding 
integration measure djj{H) according to the rule in Eq.(3), to see that it is given by 

dfi{H) = l[iXi-Xjfl[dXiXdMiU). (14) 

i<j i 
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The last factor dM{U) stands for the part of the measure depending only on the [/—variables. A 
more detailed consideration shows that, in fact, dM{U) = d^,{U), which means that it is given 
(up to a constant factor) by the invariant Haar measure on the unitary group U{N). This fact is 
however of secondary importance for the goals of the present lecture. 

Having an integration measure at our disposal, we can introduce a probability density function 
(p.d.f.) 'P{H) on the space of Hcrmitian matrices, so that V {H)dii{H) is the probability that 
a matrix H belongs to the volume element dii{H). Then it seems natural to require for such a 

probability to be invariant with respect to all the above automorphisms, i.e. ViH) = V {lJ*HU^. 

It is easy to understand that this "postulate of invariance" results in V being a function of N first 

traces Tr_ff", n = 1. . . . ,N (the knowledge of first traces fixes the coefficients of the characteristic 
polynomial of H uniquely, and hence the eigenvalues. Therefore traces of higher order can always 
be expressed in terms of the lower ones). Of particular interest is the case 

V{H) = Cexp-TrQ{H), Q{x) = a2jx'^^ + . . . + ao, (15) 

where 2j < N, the parameters a2i and C are real constants, and a2j > 0. Observe that if we take 

Q{x) = ax^ +bx + c, (16) 
then e"^'"'^*^^) takes the form of the product 

JV 

e-«[E,^«+2E,<,(4+4)]e-''E.^«e-^^ = 6"=^ J] (e""^'--''"") [] e'^^ [] e-2<. (17) 

i=l i<j i<j 

We therefore sec that the probability distribution of the matrix H can be represented as a product 
of factors, each factor being a suitable Gaussian distribution depending only on one variable in the 
set of all coordinates xu, Xi^j, yi<,j- Since the same factorization is valid also for the integration 
measure d^{H), see Eq.(9), we conclude that all these N"^ variables are statistically independent 
and Gaussian-distributed. 

A much less obvious statement is that if we impose simultaneously two requirements: 

• The probability density function V{H) is invariant with respect to all conjugations H 
H' = JJ-^HU by unitary matrices (j, that is 'P(ff') = ViH); and 

• the N'^ variables xu, Xi<j, yi^j are statistically independent, i.e. 

N N 
i=l i<j 

then the function ViH) is necessarily of the form ViH) = Ce"(''^'^"^+''^^"+'''^\ for some con- 
stants a > 0, 6, c. The proof for any N can be found in [2], and here we just illustrate its main 
ideas for the simplest, yet nontrivial case N = 2. We require invariance of the distribution with 
respect to the conjugation oi H by U G U{2), and first consider a particular choice of the unitary 

matrix J7 = ( ^ ^ ] corresponding to ^ = = ^2 = 0, and small values -C 1 in Eq.(4). In 



this approximation the condition H' = U ^HU amounts to 

( x'll ^12+W'l2\^f Xii+26'xi2 X12 + iyi2 + {X22 - Xii) \ , . 

\x[2-iy'l2 X'22 ) \ XX2- iy\2^9{x22- Xx\) X22-2Qxx2 J 
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where we kept only terms linear in 9. With the same precision we expand the factors in Eq.(18); 



1 + 20x12 



1 d/i 



/i dx 
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, h{X22) = f2{X22) 



26x1 



1 df2 



Al\x'i2) = f^2'M 



1 + 6{X22 -Xu) 



h dX22 



The requirements of statistical independence and invariance amount to the product of the left-hand 
sides of the above expressions to be equal to the product of the right-hand sides, for any 6. This 
is possible only if: 

d In /i d In /2 



2x 



12 



dxu 



dX2 



+ {X22 - Xii) 



dXi2 



which can be further rewritten as 



1 



{X22 - Xu) 



dlnfi _ din /a 

dX22 



dxi 



= const = 



1 dln/f) 



'12 



2xi2 dx 



12 



(20) 



(21) 



where we used that the two sides in the equation above depend on essentially different sets of 
variables. Denoting consti = —2a, we see immediately that 



/}^'(:ri2)cxe-2--, 



and further notice that 



dx 



+ 2axii = const2 
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dlnf2 
dX' 



+ 2aa;22 



22 



by the same reasoning. Denoting const2 = — 6, we find: 



fi{xii) (X e 



-axii — bxii 



f2{x22) cx e~ 



(22) 



and thus we are able to reproduce the first two factors in Eq.(17). To reproduce the remaining 
factors we consider the conjugation by the unitary matrix Uci = i ^ J^*^ ^ ^ . ), which cor- 

y 1 I %0i f 

responds to the choice ^ = 0, = 02 = — cc = in Eq.(4), and again we keep only terms linear 
in the small parameter a <C 1. Within such a precision the transformation leaves the diagonal 
entries xu, X22 unchanged, whereas the real and imaginary parts of the off-diagonal entries are 
transformed as 

a;'i2 = ^12 - 2ayi2, 1/12 = J/i2 + 2aa;i2. 

In this case the invariance of the p.d.f. 'P{H) together with the statistical independence of the 
entries amount, after straightforward manipulations, to the condition 



1 rfln/}^) 



1 d\nf[^ 



X12 dxi2 yi2 dyi2 
which together with the previously found /i2^(a:;i2) yields 



/i2^(2/i2) oc e" 



-2ai/i2 
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completing the proof of Eq.(17). 

The Gaussian form of the probability density function, Eq.(17), can also be found as a result of 

rather different lines of thought. For example, one may invoke an information theory approach a la 
Shanon-Khinchin and define the amount of information I\P{H)] associated with any probability 
density function 'P{li) by 

I[V{H)] = - j dn{H) V{H) \nV{H) (23) 

This is a natural extension of the corresponding definition 2\pi, . . . ,Pm] = — X^^iPm Inpm for 
discrete events 1, m. 

Now one can argue that in order to have matrices H as random as possible one has to find 
the p.d.f. minimizing the information associated with it for a certain class of 'P{H) satisfying 
some conditions. The conditions usually have a form of constraints ensuring that the probability 
density function has desirable properties. Let us, for example, impose the only requirement that 
the ensemble average for the two lowest traces TrH, TiH^ must be equal to certain prescribed 
values, say E 



TiH 



= b and E 



TrH^ 



= a > 0, where the £^ [■ • ■] stand for the expectation value 

with respect to the p.d.f. P{H). Incorporating these constraints into the minimization procedure 
in a form of Lagrange multipliers fi, 2^2, we seek to minimize the functional 

I[V{H)] =- J dn{H) V{H) {inP(fl') - v^TyiH - z/aTrff^j . (24) 

The variation of such a functional with respect to SP{H) results in 

SI[P{H)] = - j dn{H) 8V{H) {1 + InP(fl') - v{Vr:H - z/aTr^^} = (25) 

possible only if 

V{H) cx exp{j/iTriJ + V2Tt:H^} 

again giving the Gaussian form of the p.d.f. The values of the Lagrange multipliers are then 
uniquely fixed by constants a, h, and the normalization condition on the probability density func- 
tion. For more detailed discussion, and for further reference see [2], p. 68. 

Finally, let us discuss yet another construction allowing one to arrive at the Gaussian Ensembles 
exploiting the idea of Brownian motion. To start with, consider a system whose state at time t 
is described by one real variable x, evolving in time according to the simplest linear differential 
equation = —x describing a simple exponential relaxation x{t) = xoe~* towards the stable 
equilibrium x = 0. Suppose now that the system is subject to a random additive Gaussian white 
noise ^{t) function of intensity D ^, so that the corresponding equation acquires the form 

^^x = -x + m, E^[^{ti)^{h)]=D6{h-t2), (26) 



^The following informal but instructive definition of the white noise process may be helpful for those not very 
familiar with theory of stochastic processes. For any positive t > and integer k > 1 define the random function 
?fc(*) = V 2/"" z_,j^—Q cosnt, where real coefficients a„ are all independent, Gaussian distributed with zero mean 

E[a„] = and variances -Efag] = D/2 and = D for 1 < n < k. Then one can, in a certain sense, consider 

white noise as the limit of ffe(t) for k — > oo. In particular, the Dirac S{t — t') is approximated by the limiting value 

f Bin p+l/2)(f-f')] 
27r sin(t-t')/2 
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where [. . .] stands for the expectation value with respect to the random noise. The main char- 
acteristic property of a Gaussian white noise process is the following identity: 



exp ■ 



expj^^ v''it)dt^ 



(27) 



valid for any (smooth enough) test function v{t). This is just a direct generalization of the standard 
Gaussian integral identity: 



I It: a 



e 2 . 



(28) 



valid for Rea > 0, and any (also complex) parameter h. 

For any given realization of the Gaussian random process ^(f) the solution of the stochastic 
differential equation Eq.(26) is obviously given by 



x(t) 



xo+ [ e^^{T)dT 
Jo 



(29) 



This is a random function, and our main goal is to find the probability density function ■p(i, x) for 
the variable x{t) to take value x at any given moment in time t, if we know surely that x(0) = xq- 
This p.d.f. can be easily found from the characteristic function 



T{t,q)=E^ 



-iqx{t) 



■ exp < —zqxoe 



(30) 



obtained by using Eqs. (27) and (29). The p.d.f. is immediately recovered by employing the 
inverse Fourier transform: 



r{t,x) = p ^e^^^^i 



-iqxit) 



^T^D{1 - e-2*) 



exp • 



■ D{1 - e-2*) 



(31) 



The formula Eq.(31) is called the Ornstein-Uhlenbeck (OU) probability density function, and 
the function x{t) satisfying the equation Eq.(26) is known as the 0-U process. In fact, such a 
process describes an interplay between the random "kicks" forcing the system to perform a kind of 
Brownian motion and the relaxation towards x = 0. It is easy to see that when time grows the OU 
p.d.f. "forgets" about the initial state and tends to a stationary (i.e. time-independent) universal 
Gaussian distribution: 



V^t oo, x) 



(32) 



Coming back to our main topic, let us consider N'^ independent OU processes: N of them 
denoted as 

:Xi = -Xi + ^i{t), l<i<N (33) 



dt 



and the rest A''(A'' — 1) given by 

d 



Xij = -Xij + {t) , ^Vij = -Vij + ^Ij' {t) 



(2)/ 



dt 



df 



(34) 
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1 r 1 

Tit, H) oc Const x — exp \ -— --Tr (h - iIoe"*V 



(36) 



where the indices satisfy 1 < i < j < N. Stochastic processes £,{t) in the above equations 
are taken to be all mutually independent Gaussian white noises characterized by the correlation 
functions: 

Ei Kn(ti)^i.(i2)] = 2DSi,,iJit, - t2), [Q{ti)ek!{t2)] = D6^,,,Ji,kSj,iS{ti ' h)- (35) 

As initial values Xi{0), Xij (0), i/ij (0) for each OU process wc choose diagonal and ofF-diagonal entries 
ijff-*, i = 1, . . . ,iV and RcH^^\, lmH^^\ of a fixed N x N Hcrmitian matrix iJ^"^ Let us now 
consider the Hcrmitian matrix H{t) whose entries arc Huit) = Xi{t), Hi^j{t) = Xi<^j(t) + iyi^j{t) 
for any t>Q. It is immediately clear that the joint p.d.f. V {t, of the entries of such a matrix 
H{t) will be given for any t > by the OU-type formula: 

, Pvn } i Tr ( H - ^ 

V(l - e-2*)^ 

In the limit t ^ oo this p.d.f. converges to a stationary, independent expression 

V{t,H)o:Ce-^^"^ (37) 

independent of the initial matrix Hq. We see therefore that the familiar Gaussian ensemble in 
the space of Hermitian matrices arises as the result of the stochastic relaxation from any initial 
condition, in particular, from any diagonal matrix with uncorrelated entries. In the next step one 
may try to deduce the stochastic dynamics of the eigenvalues of the corresponding matrices. Those 
eigenvalues obviously evolve from completely uncorrelated to highly correlated patterns. This very 
interesting set of question goes beyond our present goals and we refer to [2] for an introduction 
into the problem. 

After specifying the probability density function 'P{H) the main question of interest is to 
characterize the statistical properties of the sequence of eigenvalues Ai, . . . , Ajv of If. A convenient 
way of doing this is to start with the joint p.d.f. of all these eigenvalues. Because of the "rotational 
invariance" assumption the function V{H) depends in fact only on the eigenvalues, for example for 

the "symmetric" Gaussian case 6 = wc have T{H) cx e ^i=i » . Moreover, wc have seen that 
the integration measure dfj,{H) when expressed in terms of eigenvalues and eigenvectors effectively 
factorizes, see Eq.(14). Collecting all these facts we arrive at the conclusion, that the relevant joint 
p.d.f of all eigenvalues can be always written, up to a normalization constant, as 

P(Ai,...,Ajv)rfAi...rfAjv oce"Sti'3(^*) JJ(Ai- Aj)^ rfAi...rfAjv (38) 

i<j 

for a general, non-gaussian weight e^'^^'^'^^. Wc immediately sec that the presence of the "Ja- 
cobian factor" Yli^j {K — A^)^ is responsible of the fact that the eigenvalues are correlated in a 
non-trivial way. In what follows we are going to disregard the fact that eigenvalues Aj were initially 
put in increasing order. More precisely, for any symmetric function f oi N real variables Ai, . . . , Ajv 
the expected value will be calculated as 

/ /(Ai, . . . , Ajv)7'(Ai, . . . , Ajv)dAi . . . rfAjv. 

Indeed, with p.d.f. being symmetric with respect to permutations of the eigenvalue set, disregard- 
ing the ordering amounts to a simple multiplicative combinatorial factor n! in the normalization 
constant. 
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Our main goal is to extract the information about these correlations in the limit of large size 
N. Prom this point of view it is pertinent to discuss a few quantitative measures frequently used 
to characterize correlations in sequences of real numbers. 



3 Characterization of Spectral Sequences 

Let — oo < Ai, A2, . . . , Aat < 00 be the positions of N points on the real axis, characterized by the 
joint probability density function (JPDF) 

P(Ai, A2, . . . , Aat) dXi ... dXN 

of having, regardless of labelling, one point in the interval [Ai, Ai + dXi], another in the interval 
[A2,A2 + (iA2],..., another in [A^v, Ajv + c^Ajv]. Since in this section we deal exclusively with real 
variables, the bar will stand for the expectation value with respect to such a JPDF. 

The statistical properties of the sequence {Aj} are conveniently characterized by the set of 
n— point correlation functions, defined as 



7?.„(Ai, A2, . . . , A„) = — / ^(Ai, A2, . . . , Ajv) dXn+i ■ ■ ■ rfAjv- 

(N -n)\ J 



(39) 



It is obvious from this definition that the lower correlation functions can be obtained from the 
higher-order ones: 



TZn{Xi, X2, . . . ,Xn) = j-rz r / 7?.„+i(Ai, A2, . . . , A„+i) rfA„+i. (40) 

(N -n) J 

To provide a more clear interpretation of these correlation functions we relate them to the 
statistics of the number Nb of points of the sequence {Aj} within any set B of the real axis (e.g an 
interval [a, 5]). Let Xb{x) be the characteristic function of the set B, equal to unity if x G i? and zero 
otherwise. Introduce the exact density function pn{X) of the points {Aj} aroimd the point A on the 
real axis. It can be conveniently written using the Dirac's 5— function as pn{X) = J2f=i ^(^ ~ ^i)- 
Then Nb= J XB{X)pN{X)dX. 

On the other hand, consider 

J 7^l(Al)rfAl = J XB{Xi)ni{Xi)dXi = N J xb{Xi)V{Xi,X2,...,Xn) dXi . . . Xn 

N 

= / ^AB(Ai)7'(Ai,A2,...,Ajv) (iAi ... Ajv (41) 

i=l 

and therefore 

/ TZi{Xi)dXi = Nb = expectation of the number of points in B. (42) 
Jb 

Similarly, consider 

J XB{Xt)xB{X2)Tl2{XuX2)dXidX2 = N{N - 1) y Xb(Ai)xs(A2)7'(Ai, A2, . . . ,Xn) dXi . . . Xn 

N 

= J Y.XB{^i)XB{Xj)V{Xi,X2,...,XN) dXi ... Xn, (43) 
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which can be interpreted as 

/ 7?.2(Ai, A2)rfAidA2 = expectation of the number of pairs of points in B (44) 

where if, say, Ai and A2 are in B, then the pair {1, 2} and {2, 1} are both counted. 

To relate the two-point correlation function to the variance of Nb we notice that in view of 
Eq.(42) the one-point correlation function 7?.i(A) coincides with the mean density Pjv(A) of the 
points {Aj} around the point A on the real axis. Similarly, write the mean square iV^ 

^ = J Xb{X)xb{X')pn{X)pn{X') dXdX' (45) 

and notice that 



Piv(A)pjv(A') = ^ <5(A - Xi)S{X' -Xj) = S{X - A')^ S{X - A^) + ^ 6{X - Xi)6{X' - A,) 

ij i ijij 

= S{X-X')TZi{X) + TZ2{X,X'). (46) 
In this way we arrive at the relation: 

Nl = N^+ f n2{X, X') dXdX' . (47) 

JbxB 

In fact, it is natural to introduce the so-called "number variance" statistics ^2{B) = Ng — [Nb^ 
describing the variance of the number of points of the sequence inside the set B. Obviously, 



■E^iB) =Nb+ [ [7^2(A, A') - 7^l(A)7^2(A')] dXdX' = Nb - [ l2(A, A') dXdX' 

JbxB JbxB 



(48) 



where we introduced the so-called cluster function 5^ (A, A') = 7?.i(A)7?.i(A') — TZ2{X, A') frequently 

used in applications. 

Finally, in principle the knowledge of all n— point correlation functions provides one with the 
possibility of calculating an important characteristic of the spectrum known as the "hole probabil- 
ity" A(L). This quantity is defined as the probability for a random matrix to have no eigenvalues 
in the interval (— L/2,L/2) ^. Define xl(A) to be the characteristic function of this interval. 
Obviously, 

A{L) = / . . . / P„(Ai, . . . , Aiv) n (1 - XL{Xk)) rfAi . . . dA;v (49) 
•' fc=i 

= ^(-1)^' / Pn{Xu...,XN)hj{XL{Xi),...,XL{XN)} dXi . . . dXf, , (50) 

where hj{xi, . . . , Xn} is the j — th symmetric function: 

N 

ho{xi, . . .,xn} = 1, hi{xi, . . .,Xn} = '^Xi, 



^Sometimes one uses instead the interval [—L,L] to define A(L), see e.g. [3]. 
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N 

h2{xi,...,XN} = '^XiXj, hN{xi,...,XN} =XlX2...XN■ 

i<j 

Now, for 1 < j < n 



/ . . . / P„(Ai, . . . , Ajv) n XL(Afc) dXi... dXj 
^ ^ fe=i 

_ (iV - j)! /• /■ XL(Afc)7e,(Ai, . . . , A,) dAi . . . dX^ (51) 
■/ 7ej(Ai,...,Aj) dAi...rfAj. (52) 

J\x^\<L/2 J\xi\<L/2 



{N-j)\ 



( N \ 

As hj contains ( j j terms and as 7?.j(Ai, . . . , Aj) is invariant under permutations of the argu- 
ments, it follows that 

P„(Ai, . . .,XN)hj {xl(Ai) . • ■Xi(AAf)} dXi . ..dXj 

(N-fy. ( N 



) / . . . / 7^j(Al, . . . , Xj) dXi... dXj = (53) 

/ J\xi\<L/2 J\xj\<L/2 



U ...! n 

J- J\xi\<L/2 J\xj\<L/2 



■j{Xi, . . . ,Xj) dXi . . .dXj. (54) 



Thus, wc arrive at the following relation between the hole probability and the n— point correlation 
functions: 

J^(-iy f^/^ f^/^ 

^(^) = EHA/ •••/ 7^i(Al,...,A,•)rfAl...dA,•. (55) 

~n J- J-L/2 J-L/2 



3=0 



One of the main goals of this set of lectures is to develop a method allowing to evaluate all the 
n— point correlation functions of the eigenvalues for any JPDF corresponding to unitary invariant 
ensembles of the form Eq.(15). After that we will concentrate on a particular case of Gaussian 
weight and will investigate the limiting behaviour of the kernel function Kn{X, A') as N oo. But 
even before doing this it is useful to keep in mind for reference purposes the results corresponding 
to completely uncorrelated (a.k.a. Poissonian) spectra. Those are described by a sequence of real 
points Ai, . . . , Ajv, characterized by the fully factorized JPDF: 

P(Ai, A2, . . . , Ajv) = p(Ai) . . . p(A;v). (56) 

The normalization condition requires /^p(A) rfA = 1, and we further assume p{X) to be a smooth 
enough integrable function. Obviously, for this case 

TV' 

7e„(Ai, A2, . . . , A„) = ^^_^^, p(Ai) . . . p{X„). (57) 



In particular, 7?.„(A) = Np{X) which is just the mean density p(X) of points {A^} aroimd the point 
A on the real axis, and TZn{Xi, A2) = N{N — l)p(Ai)p(A2), etc.. From this we easily find for the 
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number of levels in the domain B and for its mean square: 



= N [ p{X) dX, Nl = Nb{N - 1)/N + pV^ 
Jb 



and for the hole probability 



N 



m 



3=0 



j! (N-jy. 





3 


fL/2 


/ pW dX 




1- / p{X)dX 


J-L/2 




J-L/2 



N 



(58) 



(59) 



Finally, let us specify B to be the interval [—L/2, L/2] around the origin, and being interested 
mainly in large N ^ 1 consider the length L comparable with the mean spacing between ncigh- 

r -1 -1 

bouring points in the sequence {Aj} close to the origin, given by A = p(0) — l/[Np{0)]. In 

other words s = L/A = LNp{Q) stays finite when N — > oo. On the other hand, for large enough 
N the fimction p{X) can be considered practically constant through the interval of the length 
L = 0{1/N), and therefore the mean number of points of the sequence {A^} inside the interval 
[—L/2, L/2] will be asymptotically given by N{s) w N Lp{0) = s. Similarly, using Eq.(58) one can 

fL/2 



= (Ar-l)n;2p(A)dA«s. 



easily calculate the "number variance" So (s) = A''.^ i, i,,— N,_l 

l-T'-) L ^ 2' 

In the same approximation the hole probability, Eq.(59), tends asymptotically to A{s) w e"''. Later 
on we shall compare these results with the corresponding behaviour emerging from the random 
matrix calculations. 



4 The method of orthogonal polynomials 

In the heart of the method developed mainly by Dyson, Mehta and Gaudin lies an "integrating-out" 
Lemma [2]. In presenting this material I follow very closely [3], pp. 103-105. 

• Let Jn = Jn(x) = (Jij)i<i.j<n he an n X Ti matrix whose entries depend on a real vector 
X = {xi,X2, . ■ . , Xn) and have the form Jij = f{xi, Xj), where f is some (in general, complex- 
valued) function satisfying for some measure d^{x) the "reproducing kernel" property: 



^ f{x,y)fiy,z)df,{y) = fix,z). (60) 

Then 



I 



detJn{x) dfjb{xn) = [(l — {n— l)]rfei J„_i (61) 



where q = J f{x,x)d^{x), and the matrix J„_i = {Jij)i<i,j<n-i have the same functional 
form as Jn with x replaced by {xi,X2, ■ ■ ■ , Xn-i)- 

Before giving the idea of the proof for an arbitrary n it is instructive to have a look on the 
simplest case n = 2, when 

"^^ " ( /(^2'^l) /(^2'^2))' ^^^^^ detJn = f{xi,Xi)f{X2,X2)-f{xi,X2)f{X2,Xi). 

Integrating the latter expression over X2, and using the "reproducing kernel" property, we imme- 
diately see that the result is indeed just (q — l)f{x, x) = {q — 1) det Ji in full agreement with the 
statement of the Lemma. 
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For general n one should follow essentially the same strategy and expand the determinant as a 
sum over n! permutations P„(cr) = (cti, . . . cr„) of the index set 1, . . . , n as 



j det J„(x)d/i(a;„) = E(-l)^" y fixi^x^^) . . . f{xn,XaJ dn{xn), 



(62) 



where (—1)-^" stands for the sign of permutations. Now, we classify the terms in the sum according 
to the actual value of the index (t„ = k,k = 1,2, ... ,n. Consider first the case it„ = n, when 
effectively only the last factor Xn) in the product is integrated yielding d upon the integration. 

Summing up over the remaining (n — 1)! permutations P„_i(cr) of the index set (1, 2, n — 1) we 
see that: 

^(-1)^" / f{xi,x„^) . . . f{xn,Xn) dii{x„) =q ^ {-l)^''-^ f{xi,XaJ . . . f{x„-i,Xa^_J, 



which is evidently equal to gdet J„_i. Now consider (n — 1)! terms with an = k < n, when we 
have aj = n for some j < n. For every such term we have by the "reproducing property" 



f{xi,Xai) ...f{Xj,Xn) ■ . . f (Xn, Xk) d^J,{Xn) = f{xi,Xai) ■ ..f{Xj,Xk) ■ . . /(a;„_i, J. 

Therefore 



det J„(x)d/i(a;„) =gdet J„_i + E E (~^)^" fixi,Xai) ■ ■ ■ f{xj,Xk) ■ ■ ■ f{xn-i,x„^_^). 

fe=l P„:o-„ = fc) 

(63) 

It is evident that the structure and the number of terms is as required, and the remaining task is 
to show that the summation over all possible (n — 1)! permutation of the index set for fixed k yields 
always — det Jn-i, see [3]. Then the whole expression is indeed equal to [g — (n — 1)] det J„_i as 
required. 

Our next step is to apply this Lemma to calculating the n— point correlation functions of the 
eigenvalues Ai, . . . , A„ starting from the JPDF, Eq.(38). 
For this we notice that 



N 



/ 1 

Ai 



niV(iV-l) 
(Ai-A,) = (-l)^^det 



N-l 



1 \ 

Ajv 



A 



JV-l 



AAr(Ai, . . . , Aat), 



(64) 



where the determinant in the right-hand side is the famous van der Monde determinant. Since 

the determinant cannot change upon linearly combining its rows, the entries A*^ in (fc + 1) — th 
row of the van der Monde determinant can be replaced, up to a constant factor aoai...ajv-i, by 
a polynomial of degree k of the form: Trk{\i) = a^A^ + any polynomial in Aj of degree less than k, 
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with any choice of the coefficients ai, I = 0, . . . , k. Therefore: 



n(A.-A,) = 



N(N-l) 



1<J 



aoai...aN-i 



■det 



/ 7ro(Ai) 
7ri(Ai) 



\ 7rjv_i(Ai) 



7ro(A7v) \ 
7ri(AAr) 



7rjv-i(Ai) / 



(-1) 



iV(iV-l) 



aoai...ajv-i 



det {^T^-l{Xj))l<i,<N■ 



(65) 



Multiplying every entry in jth column in the above determinant with the factor e iQi'^j) we see 
that the JPDF can be conveniently written, up to a multiplicative constant, as 



P(Ai,...,Ajv) oc 



det(e-i«(^^)7ri_i(A,))^ 



<i,j<N 



If we let A be the matrix with the entries Aij = {,'Pi-i{^3))i<i j<N^ then 



(66) 



[det A]^ = det i^i = det ^^AjiAjk 



(67) 



This implies the following form of the JPDF: 



l<i,k<N 



V{Xi, . . . , Ajv) oc det (j)j-i{Xi)(j)j-i{Xk) 
where we introduced the notation: 

AT-l 

ifjv(A,A')= ^</.,(A)<^,(A') 
i=o 

usually called "kernel" in the literature. In our particular case 

0i_i(A) = e-^«(^)7ri_i(A) 
so that the kernel is given explicitly by 

N-l 



= det{KN{Xi,Xk)\^,^k<N (68) 



ifjv(A,A') =e-HQW+Q(^')) ^ 7r,(A)7r,(A'). 



(69) 



(70) 



(71) 



j=0 



Now it is easy to see that if we take the polynomials Tri{x) such that they form an orthonormal 
system with respect to the weight e~'^^^\ the corresponding kernel will be a "reproducing" one 
with respect to the measure d^{x) = dx, in the sense of the "integrating-out" Lemma. Indeed, 
suppose that Tri{x) satisfy the orthonormality conditions: 



J e '^^'^Ki{x)iTj{x)dx = Sij, 



(72) 
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for any indices i > 1, j > 1- Then we obviously have 



j=0 fc=0 

AT-l 

= J2 e-5(Q(^)+Q(-))^^-(a;)^,(z) = Kn{x,z) (73) 

3=0 

exactly as required by the reproducing property. Moreover, in this case obviously 

w-i 

Qn = I KN{x,x)dx I e '^'^''\j{x)'Kj{x) dx = N, 

and therefore the relation (61) amounts to 

j det {KN{xi,Xj))^^^ .^J^ dxN = det {KN{xuXj))^^^ .^^_^ . (74) 

Continuing this process one step further we see 

j J det {KN{xi,Xj))^^^j^j^ dxN-idxN = j det{KN{xi,Xj))^^^j^j^_^dxN-i (75) 
= [N-iN-2)]det{Kr,{xuXj)\^^.^^_^ (76) 

and continuing by induction 

j ■■■ J det(ii'jv(a;i,a;j))j<._^.<jy dxk+i ■ ■ -dxN = {N - ky.det{KN{x^,Xj))^^^J^^. (77) 

for k = 1,2,..., and the result is A''! for k = 0. Remembering the expression of the JPDF, Eq.(68), 
in terms of the kernel K^i(xi,Xj) wc sec that, in fact, the theory developed provided simultane- 
ously the explicit formulae for all n— point correlation functions of the eigenvalues 7^„(Ai, . . . , A„), 
introduced by us earlier, Eq.(41): 

7^„(Al, . . . , A„) = det {Km{\u Aj))i<ij<„ (78) 

expressed, in view of the relations Eq.(71) effectively in terms of the orthogonal polynomials 7rfe(A). 
In particular, remembering the relation between the mean eigenvalue density and the one-point 
function derived by us earlier, we have: 

JV-l 

MA) = Kn{X, A) = E e-'3(^)7rj_i(A)7r,_i(A). (79) 
i=i 

The latter result allows to represent the "connected" (or "cluster" ) part of the two-point correlation 
function introduced by us in Eq.(48) in the form: 



l2(Ai,A2) = pjv(Ai) /9iv(A2)-7^„(Al,A2) = [i^Jv(Al,A2)]^ (80) 
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Finally, combining the relation Eq.(55) between the hole probability A{L) and the n-point 
correlation functions, and on the other hand the expression of the latter in terms of the kernel 
iV'jv(A, A'), see Eq.(78), we arrive at 



A{L) = V ^—^ / . . . / det 

~^ J- J-L/2 J-L/2 



KN{Xi,Xj) \ 



Kn{Xj,Xj) J 



dXi...dXj. (81) 



In fact, the last expression can be written in a very compact form by noticing that it is just 
a Fredholm determinant det (X — /Cat), where ICn is a (finite rank) integral operator with the 
kernel Kn{X,X') = J2i^^ 'Pi{X)4'i{X') acting on square-integrable functions on the interval A e 
(-L/2,L/2). 



5 Properties of Hermite polynomials 

5.1 Orthogonality, Recurrent Relations and Integral Representation 

Consider the set of polynomials hk{x) defined as ^ 



dx^ 

and consider, for > Z 



hu{,x) = (-l)'=e^* — ^ (e-^*) = N^x" + • • • , (82) 



e-'''^hi{x)hk{x)dx = {-if j dxhiix)^ (e-^^j 



(83) 



d-Ki-)i^{e--^)=... = i-ir I dxe--^^h,{x) 



Obviously, ioi k > I wc have ■j^hi{x) — 0, whereas for k — I we have j^hk{x) = klN^. In this 
way we verified the orthogonality relations and the normalization conditions 



/oo 
-oo 



e~^~hi{x)hk{x)dx = Ski (84) 



for normalized polynomials 



hk{x) =: jj^hk{x). (85) 



^The standard reference to the Hermite polynomials uses the definition 



Such a choice ensures H^.{x) to be orthogonal with respect to the weight . Our choice is motivated by random 
matrix applications, and is related to the standard one as hk{x) = Hj^ ^■\/ • 
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In the theory of orthogonal polynomials an important role is played by recurrence relations: 



N[xhk{x) - khk-i{x)] , 



where we exploited the Leibniz formula for the fc— th derivative of a product. After normalization 
we therefore have 



k + 1 



N 



1/2 



hk+i{x) = xhk{x) 



1/2 



hk-i{x). 



(87) 



Let us multiply this relation with hk{y), and then replace x by y. In this way we arrive at two 
relations: 



k + 1 



-,1/2 



N 
k+1 



hk+i{x)hk{y) = xhk{x)hk{y) 



-,1/2 



N 



hk+i{y)hk{x) = yhk{x)hk{y) - 



1/2 



hk-i{x)hk{y), 



1/2 



hk-i{y)hk{x). 



(88) 
(89) 



The difference between the upper and the lower line can be written for any fc = 1, 2, . . . as 



1/2 



{hk-i{y)hk{x) - hk-i{x)hk{y)}. 



{x - y)hk{x)hk{y) = Ak+i - Ak , Ak = 
Summing up these expressions over k: 

n-l 

{x-y)^ hk{x)hk{y) = {A2 + ... + An)-{Ai + ... + An-i) = An - Ai 
fc=i 

and remembering that Ai = ^J~^{x — y) = {x — y)ho{x)ho{y) we arrive at a very important 
relation: 



^ hk{x)hk{y) 



fc=() 



n h n-i{y)hn{x) - hn-i{x)h„{y) 
x-y 



or, for the original (not-normalized) polynomials: 



n-l 



1 1 
E km^f'k{x)hk{y) = -j^^ 



hn-l{y)h„{x) - hn-l{x)hn{y) 



fe=0 



(n- l)!Af" 



x~y 



(90) 



(91) 



which are known as the Christoffel-Darboux formulae. Finally, taking the limit a; — > y in the above 
expression we see that 



n-l 



1 1 

^ fcliV^^^^""^ " (n-l)!iV" [f^n-ii^Ki^) - hn-iix)K{x)] . 



fe=0 



(92) 



Most of the properties and relations discussed above for Hermite polynomials have their ana- 
logues for general class of orthogonal polynomials. Now we are going to discuss another very useful 
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property which is however shared only by few famihes of classical orthogonal polynomials: Her- 
mite, Laguerre, Legendre, Gegenbauer and Jacoby. All these polynomials have one of few integral 
representations which arc frequently exploited when analyzing their properties. For the case of 
Hermite polynomials we can most easily arrive to the corresponding representation by using the 
familiar Gaussian integral identity, cf. Eq.(28): 



/OO 
dge-'^«'+'^«^. (93) 
-OO 



Substituting such an identity to the original definition, Eq.(82), we immediately see that 

hk{x) = {-iNfJ—e'''T / dgg'=e-^«'+'^«^, (94) 
V 27r 

which is the required integral representation, to be mainly used later on when addressing the 
large- A'' asymptotics of the Hermite polynomials. Meanwhile, let us note that differentiating the 

above formula with respect to x one arrives at the useful relation ■j^hk{x) = Nxhk{x) — hk+i [x) = 
Nk hk-i{x). This can be further used to simplify the formula Eq.(92) bringing it to the form 

n-1 ^ ^ 

E fcLATfe''^^^) = (n-2)!Ar»-i l^H^) ' K-i{x)hn+i{x)] . (95) 



fc=0 



5.2 Saddle-point method and Plancherel-Rotach asymptotics of Her- 
mite polynomials 

In our definition, the Hermite polynomials hk{x) depend on two parameters: explicitly on the 
order index fc = 0, 1, . . . and implicitly on the parameter N due to the fact that the weight function 

e~^^ contains this parameter. Invoking the random matrix background for the use of orthogonal 
polynomials, we associate the parameter N with the size of the underlying random matrix. From 
this point of view, the limit iV ^ 1 arises naturally as we arc interested in investigating the 
spectral characteristics of large matrices. A more detailed consideration reveals that, from the 
random matrix point of view, the most interesting task is to extract the asymptotic behaviour of 
the Hermite polynomials with index k large and comparable with A, i.e. k = N + n, where the 
parameter n is considered to be of the order of unity. Such behaviour is known as Plancherel-Rotach 
asymptotics. 

To understand this fact it is enough to invoke the relation (79) expressing the mean eigenvalue 
density in terms of the set of orthogonal polynomials: 

N-1 

MA) = ^Jv(A, A) = e-*^' Yl W' (96) 

3=0 

1 



e 2 



{N - 2y.N^- 



^ [hl{X) - /ijv-i(A)/ijv+i(A)] , (97) 



where we used the expressions pertinent to the Gaussian weight: Q{X) = , 7r^;(A) = hk{X), and 
further exploited the variant of the Christoffel-Darboux formula, Eq.(95). It is therefore evident 

that the limiting shape of the mean eigenvalue density for largo random matrices taken from 
the Gaussian Unitary Ensemble is indeed controlled by the Plancherel-Rotach asymptotics of the 
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Hermite polynomials. In fact, similar considerations exploiting the original Christoffel-Darboux 
formula, Eq.(90), show that our main object of interest -the kernel Kn{X,\') - can be expressed 
as 



Kn{'\ = ^-^(x^+x'^) hN-iWhN{y) - /tAf-i(A)/tAf(A') ^^^^ 

A — A' 



and therefore all the higher correlation functions are controlled by the Plancherel-Rotach asymp- 
totics as well. 

For extracting the required asymptotics we are going to use the integral representation for the 
Hermite polynomials. We start with rewriting the expression Eq.(94) as 

hN+„ix) = i-iNf+'\ — dgg'^+"e-*(«-'^)' (99) 



where 



^-iN)^+n^^ [I^^^{X) + (-l)^+"7;v+n(-a;)] , (100) 

lN+n{x)=J^ dqq-e^f<^'>\ f [q) = \nq - -{q - ixf . (101) 

The latter form is suggestive of exploiting the so-called saddle-point method (also known as the 
method of steepest descent or method of stationary phase) of asymptotic evaluation of integrals of 
the form 

' (j){z)e^^^'Uz, (102) 



where the integration goes along a contour F in the complex plane, F{z) is an analytic fimction 
of z in some domain containing the contour of integration, and iV is a large parameter. The main 
idea of the method can be informally outlined as follows. Suppose that the contour T is such that: 
(i) the value of Re_F has its m,aximum, at a point zq G F, and decreases fast enough when we go 
along F away from zq, and (ii) the value of ImF stays constant along F (to avoid fast oscillations 
of the integrand). Then we can expect the main contribution for N ^ 1 to come from a small 
vicinity of zq = xq + iyo- 




Figure 2: Schematic structure of a harmonic function in the vicinity of a stationary point 2:0- 

Since the function ReF is a harmonic function of a; = Rez, y = Imz, it can have only saddle 
points (see Fig. 2) found from the condition of stationarity F'{zo) = 0. Let us suppose that there 
exists only one such saddle point z — zq, close to which we can expand F(z) w F{zq) + C{z — zq)^, 
where C = ^F"{zo). Consider the level curves [ReF](x, y) — [ReF]{xQ,yo), which are known either 
to go to infinity, or end up at a boundary of the domain of analyticity. In the vicinity of the chosen 
saddle-point the equation for the level curves is Kc[F{z) — F{zo)] — 0, hence 

Re[\C\e''{z-zof] = [{x - xo)\y - yof] cosO - 2{x - xo){y - yo)sm{e) = 0, 
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which describes two orthogonal straight lines passing through the saddle-point 



y = yo + tan ( ^ - ^ j (a; - xq), y = yo - iaxi(^ + ^]{x - xq) 



partitioning 



(TT-e)/2 




-(n/4-8/2) 



-»■ X 



Figure 3: Partitioning of the x — y plane in a vicinity of the stationary point zq into four sectors by 
"level curves" (solid lines). Dashed line shows the bi-sector of the negative sectors: the direction 
of the steepest descent contour. 

"negative" ones KeF[z) < ReF{zo), see Fig. 3 If the "edge points" of the integration contour 
r (denoted zi and Z2) both belong to the same sector, and BjeF{zi) ^ Rei^(2;2), one always can 
deform the contour in such a way that KcF(z) is monotonically increasing along the contour. Then 
obviously the main contribution to the integral comes from the vicinity of the endpoint (of the 
largest value of ReF(^)). Essentially the same situation happens when zi belongs to a negative 
(positive) sector, and Z2 is in a positive (resp., negative) sector. And only if the two endpoints 
belong to two different negative sectors, we can deform the contour in such a way, that IieF{z) has 
its maximum along the contour a.t z = zo, and decays away from this point. Moreover, it is easy 
to understand that the fastest decay away from zo will occur along the bi-sector of the negative 
sectors, i.e. along the line y ~ yo = tan2^(a; — xq). Approximating the integration contour in 

the vicinity of zq as this bi-sector, i.e. hy z = zq + {x — xo)^^^^2)' ^® S^t the leading term of the 
large- A'' asymptotics for the original integral by extending the limits of integration in the variable 
X = X — xo from —00 to 00: 



/ 0(.)e^^(^)d.«<A(.o)e^^(-)^^ 



dxe-''\^\^\mH/2 



27r 



N]F"izo)\ 



e^p{NFizo) + -(tt - Arg[F"{zo)/2)])}. 



(103) 



It is not difficult to make our informal consideration rigorous, and to calculate systematic correc- 
tions to the leading-order result, as well as to consider the case of several isolated saddle-points, 
the case of a saddle-point coinciding with an end of the contour, etc., see [7] for more detail. 

After this long exposition of the method we proceed by applying it to our integral, Eq.(lOl). 
The saddle-point equation and its solution in that case amount to: 



F'(,) = - 



q + ix = 0, q = q± 



^ (^ix ± -\/4 - a;2^ . 
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It is immediately clear that we have essentially three different cases: a) \x\ < 2 (b) > 2 and (c) 

kl = 2. 

1. |a;| < 2. In this case we can introduce x = 2 cos <p, < 4> < n, so that q± = i cos (p ± sin (j), or 
= e^^'-'^^'^^^\ q_ = ej'-'^^'^/^K It is easy to understand that wo arc interested only in 
(see Fig.4) and to calculate that Re/(g+) = 5003(2^). On the other hand Re/(g) — > —00 
when either ^ 00 or ^ 0, so that both endpoints belong to negative sectors. To 
understand whether they belong to the same or different sectors, we consider the values of 
Re/(g) = Ini? — ^{R^ — x"^) along the real axis, q = R-ieal. As a function of the variable R 
this expression has its maximal value Re/(g = 1) = — ^ + 2 cos^ cj) at q = R = 1. 

.Imq 

s.d. contour 
\ 1 *' Req 



Figure 4: Structure of the saddle-points q± and the relevant steepest descent contour for |a;| < 2. 

Noting that Rc/(g = 1) — Rc f{q+) = cos^ > 0, wc conclude that the point q ~ I belongs 
to a positive sector, and therefore the existence of this positive sector makes the endpoints 
q = and q = 00 belonging to two different negative sectors, as required by the saddle-point 
method. Calculating 

/"(g+) = - (^1 + ^) =2isin</,e'^ 

we see that |C| = sincj), = + n/2, and further 

"1 



f"{q+) = I cos i2<l>)+i 



2 sin (2.^) -(j) + Tr/2 



Now we have all the ingredients to enter in Eq.(103), and can find the leading order con- 
tribution to I]\i^n{x). Further using Iff^ni—x) — lN+k{x), valid for real x, we obtain the 
required Plancherel-Rotach asymptotics of the Hermite polynomial: 

hN+nix) « ^"^^"y^J^e^ COS I (n + 1/2)0 - 7r/4 + iV (^<^ - ^ sin 2</.^ | , (104) 

where x = 2 cos (j), < < tt, n <^ N. 
Now we consider the opposite case: 

2. I a; I > 2. It is enough to consider explicitly the case x > 2 and parameterize x = 2 cosh (p, < 
(f) < 00. The saddle points in this case are purely imaginary: 

a± = |(2cosh(?i±2sinh(?i) =16=^"^. (105) 
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One possible contour of the constant phase passing through both points is just the imaginary 
axis q = iy, where Im/(gr) = -jt/2 and Re/(g) = Iny + i(y — x)"^. Simple consideration gives 
that 2/_ = e~'^ corresponds to the maximum, and = e"^ to the minimum of Ilc f{q) along 
such a contour. It is also clear that for q = iy+ the expression Re/((7) has a local maximum 
along the path going through this point in the direction transverse to the imaginary axis. 
The "topography" of Re/(g) in the vicinity of the two saddle-points is sketched in Fig. 5 



q+ 



Req 



Figure 5: The saddle-points q±, the corresponding positive sectors (shaded), and the relevant 
steepest descent contour (bold) for \x\ > 2. 

This discussion suggests a possibility to deform the path of integration F to be a contour 

of constant phase lTaf{q) consisting of two pieces - Ti = {q = iy, < y < and r2 
starting from q = iy^ perpendicular to the imaginary axis and then going towards g = oo. 
Correspondingly, 

lN+n{x>2)= dgg"e^-^(«) = / dgg" e^-^(«) -h / dgg"e^-^(«). (106) 
Jo Jo Jt2 

The second integral is dominated by the vicinity of the saddle-point q = iy+, and its evalu- 
ation by the saddle-point technique gives: 



ir, 2 V -/V sinh (j) 



where the factor i arises due to the saddle-point being simultaneously the end-point of the 
contour. As to the first integral, it is dominated by the vicinity of iy-, and can also be 
evaluated by the saddle-point method. However, it is easy to verify that when calculating 
hN+n{x) oc [/jv+rt(a;) + (— l)-'^+"/iv+n(— a;)] the corresponding contribution is cancelled out. 
As a result, we recover the asymptotic behaviour of Hermite polynomials for a; > 2 to be 
given by: 

i\Tn+N — — 

hN+n{x = 2cOsh(2<^) > 2) = ^(n+i)0-f (sinh(20)-20)_ (^q^^ 

V sinh (f) 

Now we come to the only remaining possibility, 

3. |x| = 2. It is again enough to consider only the case x = 2 explicitly. In fact, this is quite a 

special case, since for x ^ 2 two saddle-points q± degenerate into one: q^ q^ ^ i. Under 
such exceptional circumstances the standard saddle-point method obviously fails. Indeed, 
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the method assumed that different saddle-points do not interfere, which moans the distance 
1^+ — g_| = ^J\A — x^\ is much larger than the typical widths W ^M\f"{ )| regions 
around individual saddle-points which yield the main contribution to the integrand. Simple 



calculation gives \ f"{q±)\ = \1 + \ — \J\A — .T^j, and the criterion of two separate saddle- 
points amounts to |a; — 2| » iV~^/^. We therefore see that in the vicinity of a; = 2 such that 
|x — 2| ~ N~^/^ additional care must be taken when extracting the leading order behaviour 
of the corresponding integral lN+n{x = 2) as iV ^ oo. 

To perform the corresponding calculation, we introduce a new scaling variable ^ = iV^/^(2 — 
x), and consider ^ to be fixed and finite when N ^ oo. We also envisage from the discussion 
above that the main contribution to the integral comes from the domain around the saddle- 
point Qsp = i oi the widths I? — i| ~ ■\/|2 — x\ iV~^/^. The integral we are interested in is 
given by 

Jn{0 = I cigg^+"e-*(«-")' (108) 



t \" N 
e 



7V1/3 



\ (109) 



where we shifted the contour of integration from the real axis to the line q = i + j^-j^ , — oo < 
t < oo to ensure that it passes through the expected saddle-point qsp = i, and also scaled the 
integration variable appropriately. Now we can consider finite when N ^ 1, and expand 
the integrand accordingly. A simple computation yields: 

Jiv»i(0«A^-'/'i'^+"e^/2-^'''« / rfte-^«*+^V. (110) 

J — oo 

Up to a constant factor the integral appearing in this expression is, in fact, a representation 
of a special function known as Airy function Ai{^): 

Ai{^) = - [ dt cos (£,t + %\ (111) 



which is a solution of the second-order linear differential equation ■^F{^) — ^F{^) = 0. A 
typical behaviour of such a solution is shown in Fig. 6. 

All this results in the asymptotic behaviour of the Hermitian polynomials in the so-called 
"scaling vicinity" of the point x = 2: 



I'N+n 



= 2 - ^) - ^"/^-"^'^^ ^ii-O- (112) 

Such scaling vicinity of a; = 2 is what gives room for a transitional regime between the oscil- 
lating asymptotics of the Hermite polynomials for < 2, see Eq.(104), and the exponential 
decay typical for \x\ > 2 as described in Eq.(107). Formula (112) indeed matches Eq.(104) 
as ^ oo and Eq.(107) as ^ — oo. This statement is most easily verified by invoking the 
known asymptotics of the Airy function: 



r'/^7r-i/2cos(-|^^/^ + f), ^^00 



M-0^{ 1 .-f|||3/= cl_^ (113) 



and identifying </> = |^|^/^iV <C 1 in the corresponding expressions. 
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Figure 6: The Airy function Ai{^). 



Now we arc going to apply the derived formulae for extracting the large-N behaviour of the 
mean eigenvalue density and the kernel as described in Eqs.(96) and(98), respectively. In fact, it 
is more conventional in the random matrix literature to use the mean density to be normalized to 
unity, rather than to N. Such a density will have a well-defined large-N limit which we will denote 
as Poo (A). 



6 Scaling regimes for GUE 

6.1 Bulk scaling: Wigner semicircle and Dyson kernel. 

The first case to be considered is the spectral parameter |A| < 2 when we can parameterize A = 
2cos(^, and exploit the Plancherel-Rotach expression (104) for the Hermite polynomials. Further- 
more, denoting a = ^(j)—j+N {4> — ^ sin 2(^) , and using the identity cos^ a— cos {a + (j>) cos (a — 0) = 
sin^ (j) we find that h%{X) - hN-i{X)hN+i{X) w 2A^2W ^^n cos (24,) _ Furthermore, using the Stir- 
ling formula: (A'' — 1)! « y^^A^^e"^ and remembering that smcj) = i-\/4 — we arrive, after 
collecting all factors, to the famous Wigner semicircular law for the mean (normalized) spectral 
density: 

= Poo(A) = ^V4-A2, |A| < 2. (114) 

We see that in the limit of large A'' all A'' eigenvalues of GUE matrices are concentrated in the 
interval [—2,2], and the typical separation of two neighbouring eigenvalues close to an "internal" 
point A G (-2,2) is A = j^^^ = 0{N-^), see Fig. 7. 

Let us now follow the same strategy for obtaining, under the same conditions, the limiting 
expression for the kernel K{X, A') using for this goal formula (98). We have: 

/ijv(A)/ijv-i(A') - hN{X')hN-iW (115) 
1/ sin (j) sin 0' 

where af = ±i(/)- f + A^ (</) - isin2(?!)) , af = ±i(/)' - f + A' (0' - i sin 20'). The next step is to 
introduce ^ = {(p + (j)')/2 and f2 = (0 — (/)')/2, and to consider the parameter f2 to be of the order 



lim 

N—>fx> 
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2ttp„( A) 




Figure 7: Wigner semicircular density. Sketch shows a typical spacing between neighbouring levels 
in the bulk of the spectrum. 

oiO{N^^) when taking the limit. This choice ensures that the distance A — A' = 2 [cose/) — cos 0'] « 
4Q sin w AflTTpoo (A) is of the order of the mean eigenvalue separation A - the typical scale for the 
correlations between the eigenvalues in the bulk of the spectrum- and thus must be reflected in the 
structure of the kernel. To this end we denote fl = w/N, and keep in the expressions for terms 
up to the order 0(1), i.e. writing 2 = + [^iV" ~ f i 2a; sin^ ip] , where /? = (V' — 5 sin 2ip^ . 
With the same precision: 

cosa^" cosa2 ~ cosaj" cosa^ ~ sini/»sin (4a;sin^ ip) w sin ^ sin [7rptx)(A)A''(Ai — A2)] 
and cos 2^1 + cos 2^2 ~ 2 — 1^ substituting all these factors back into Eq.(115), we get 

/iAr(A)/ijv-i(A') - /iAr(A')/iAr_i(A) « 2N^^e^^i^-^) sin [7rpo„(A)iV(A - A')]. (117) 

Now, taking into account the normalization factors in /lAr(A) and /iAr_i(A), sec Eq.(85), using again 
the Stirling formula and invoking Eq.(79) we arrive at the following asymptotic expression for the 
kernel, Eq.(98): 

= i^oo[iVpoo(A)(A-A')], K^{r) = ^—^ (118) 

nr 

where if 00 {f) is the famous Dyson scaling form for the kernel. The formula is valid as long as both 
A and A' are within the range (—2,2), and A — A' = 0{N^^). Such choice of the parameters is 
frequently referred to as the "bulk scaling" limit. 

Having at our disposal the limiting form of both mean eigenvalue density and the two-point 
kernel we can analyse such important statistical characteristics of the spectra as e.g. the "number 
variance", see Eq.(48), for an interval of the length L comparable with the mean spacing close to 
the origin A = [Npoo{0)]~^. Under such a condition we can legitimately employ the scaling form 
Eq.(118) of the kernel when substituting it into formula (80) for the cluster function y2(A, A'). In 
this way we arrive at 

E2(L) = N p^{\)d\ - d\ dX'p^{X)po.{X')Kl [iVpoo(A)(A - A')] 

J-L/2 J-L/2 J-L/2 

.s/2 .s/2 

= s- du du'Kl^[{u-u')]. (119) 

J-s/2 J-s/2 



lim 

N^oo 



KNiX,X) 
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Here we used the fact that with the same precision we can put pooW ~ Poo{^') ~ Poo(O) in the 
above expression, and introduced the natural scahng variables: u = X/ A, u = A'/A as well as 
the scaled length of the interval s = L/A (cf. a similar procedure for Poissonian sequences after 
Eq.(55). To simplify this expression further we introduce tt+ = {u + u')/2, r = u — u' as integration 
variables, and use that, in fact Koo{r) = Koo{\r\). The number variance takes the final form: 



Ms) 



r dr ' , du+Kl{\r\) =s-2 f dr{s-r) 



sm 7rr 
7rr 



(120) 



In fact, we are mainly interested in the large-s behaviour of this expression. To extract it, we use 
the identity: 2 Jg°° dr [ ^'"^'' ] = /q°° dx [^^] = 1, and rewrite the above expression as 



2s /■°° , sin^x 1 r'^'l-cosa; , 

The second integral obviously grows logarithmically with s and dominates at large s. A more 
accurate evaluation gives the asymptotic formula: 

E2(s » 1) = ^ [In 2775 + 7 + 1] + 0{l/s). (122) 

TT 

where 7 = 0.5772... is Euler's constant. This is much slower than the linear growth T^^i^s 1) = s 
typical for uncorrelated (Poissonian) sequence, see Fig. 8. The explanation of the slow growth is 

that the sequence of eigenvalues is, in fact, quite ordered, with quite regular spacings of the order 
of A, and therefore the number of points in the interval does not fluctuate as much as it does for 
uncorrelated sequence. 




Figure 8: The logarithmic growth of the number variance S(s) for large GUE matrices versus 
linear growth for uncorrelated (Poissonian) spectrum. 

As to another important and frequently used statistical characteristic of spectral sequences - 
the "hole probability"- its calculation amounts to investigating the asymptotics of the Fredholm 
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determinant of the kernel -ftTjv^oo) see Eq.(81). This is a very difScult mathematical problem, and 
the most elegant solution uses an advanced mathematical technique known as the Riemann-Hilbert 
method[3]. Let us just quote the result: 

A(s » 1) cx -ije-* ^^ (123) 

This Gaussian decay should be again contrasted with a much slower exponential decay typical 
for uncorrelated sequences as indeed in full correspondence with a "quasiregular" structure of the 
random matrix spectrum. 



6.2 Edge scaling regime and Airy kernel 

As we already know, in the vicinity of the "spectral edge" x = 2 (and its counterpart x = —2) 
the Plancherel-Rotach asymptotics of the Hermitian polynomials changes, and is basically given 
by the Airy function, see Eq.(112). This certainly results in essential modifications of the large— 
behaviour of the mean eigenvalue density and of the two-point kernel as long as |A— 2| ~ N~'^/^. To 
extract the explicit formulae for this so-called "edge scaling" limit one may try the same strategy as 
in the bulk. However, one immediately discovers that simple substitution of Eq.(112) into formula 
(97) for the mean density yields zero. A possible way out may be to calculate the next-to-leading 
order corrections to the asymptotics of h{^{x), but wo will rather follow a slightly different (and 
more direct) route and consider the integral representation for the main combination of interest: 

Vr,{\) = hl{\)-hr,-,{\)hr,+A\)=^^^N^'' H dq, T dg2^^e^I/(«^)+/(«^)I (124) 

J-oo J-oo 9i 

where we exploited Eq.(99,101), and defined, as before, f{q) = Ing — ^ (g — iX)^. To Evaluate this 
integral in the edge scaling limit, we follow a familiar procedure: introduce the scaling variable 
^ = A^^/^(A — 2), shift the contours of integration from the real axis to the lines gi,2 = i + 
-j^YTa ,— 00 < ti^2 < oo, consider ^, ti^2 to be fixed and finite when N ^ oo, and expand the 
integrand accordingly around the saddle-points ti^2 = 0. Simple calculation yields, in complete 
analogy with Eq.(llO), the expression: 

27r 

rftie*«*^+'^ j dt2tle'^*'+''-^ - j rftiiie^«*i+'^ j rfis ^2 e'«*^+'^ | . (125) 

The only essential difference from Eq.(llO) which deserves mentioning is the choice of the 
integration contour F which ensures the existence of all the integrals involved. Obviously, one 
can not simply take F = (—00,00), but a more detailed investigation shows that the correct 
contour must be chosen in such a way as to be asymptotically tangent to the line Avgt = 5^/6 for 
Ret ^ —00, and asymptotically tangent to Argt = n/6 for Ref — > 00, see Fig.9. It is then evident, 
that 



Ai{^) = - f die*«*+'^, -iAi'{^) = - f dtte'^'+''^, -Ai"{^) = - f dtt 
Jr Jv Jr 



and collecting all factors we find the expression of the mean eigenvalue density close to the "spectral 
edge" : 

p{X = 2 +^iV-2/3) cx pe(0 = Az'iO^ - Ai"{^)Az{0. (126) 
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Figure 9: The contour of integration T in the definition of the Airy function. 




Figure 10: Behaviour of the spectral density close to the spectral edge. 



For ^ < the function shows noticeable oscillations , see Fig. 10, with neighbouring maxima 
separated by distance of the order of A, — Aj_i oc Aedge =~ iV~^/^ and reflecting typical positions 
of individual eigenvalues close to the "spectral edge". In contrast, for ^ > the mean density 
decays extremely fast, reflecting the typical absence of the eigenvalues beyond the spectral edge. 

A very similar calculation shows that under the same conditions the kernel Kn{X, A') assumes 
the form: 

A(Ci,?2) = 7 T (127) 

known as the Airy kernel, see [8]. 



7 Orthogonal polynomials versus characteristic polynomials 

Our efforts in studying Hermite polynomials in detail were amply rewarded by the provided pos- 
sibility to arrive at the bulk and edge scaling forms for the matrix kernel in the corresponding 
large-N limits. It is those forms which turn out to be universal, which means independent of the 
particular detail of the random matrix probability distribution, provided size of the corresponding 
matrices is large enoiigh. This is why one can hope that the Dyson kernel would be relevant to 
many applications, including properties of the Riemann (^-function. An important issue for many 
years was to prove the universality for unitary-invariant ensembles which was finally achieved, first 
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in [10]. 

In fact, quite a few basic properties of the Hermite polynomials are shared also by any other set 

of orthogonal polynomials ttj. (x) . Among those worth of particular mentioning is the Christoffel- 
Darboux formula for the combination entering the two-point kernel Eq.(71), (cf. Eq.(91)): 



E" ^ /X / s , 7i"„_i(y)7r„(a;) - 7r„-i(a;)7r„(y) 
7rfc(x)7rfc(y) = 6„ , 
1? — tt 



fe=o 



x-y 



(128) 



where 6„ are some constants. So the problem of the universality of the kernel (and hence, of 
the n-point correlation functions) amounts to finding the appropriate large- scaling limit for the 
right-hand side of Eq.(128) (in the "bulk" of the spectrum, or close to the spectral "edge"). 

The main dissatisfaction is that explicit formulas for orthogonal polynomials (most important, 
an integral representation similar to Eq.(94)) are not available for general weight functions dw{X) = 
g-Q(A)^^ For this reason we have to devise alternative tools of constructing the orthogonal 
polynomials and extracting their asymptotics. Any detailed discussion of the relevant technique 
goes far beyond the modest goals of the present set of lectures. Nevertheless, some hints towards 
the essence of the powerful methods employed for that goal will be given after a digression. 

Namely, I find it instruc;tive to discuss first a question which seems to be quite unrelated,- the 
statistical properties of the characteristic polynomials 



N 

Zn{h) = det (^iiIn -H)= [JIm " A,) 



(129) 



for any Hermitian matrix ensemble with invariant JPDF V{H) oc e'x^{—N'TrQ{Hy] . Such objects 
arc very interesting on their own for many reasons. Moments of characteristic polynomials for 
various types of random matrices were much studied recently, in particular due to an attractive 
possibility to use them, in a very natural way, for characterizing "universal" features of the Riemann 
C-function along the critical line, see the pioneering paper[ll] and the lectures by Jon Keating in 
this volume. The same moments also have various interesting combinatorial interpretations, see 
e.g. [12, 13], and are important in applications to physics, as I will elucidate later on. 

On the other hand, addressing those moments will allow us to arrive at the most natural way 
of constructing polynomials orthogonal with respect to an arbitrary weig ht dw{X) = e-'5(^)dA. To 
understand this, we start with considering the lowest moment, which is just the expectation value 
of the characteristic polynomial: 



/oo ,-oo N N 

dw{Xi) ... dw{XN) J{{Xi - Xjf [](/x - Xi) 
-°° i<j i=i 



(130) 



We first notice that 



N 



( 1 

Ai 



N 



W{.X,-Xj) Will - Xi) (X del 



l<3 



N-1 



1 

Ajv 



\N-1 



1 



Af-l 
.N 



(131) 
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Indeed, the right-hand side is obviously a polynomial of degree N in the variable /x, with roots at 
/i = /Lti, /Lt2, • • • , fiN- Therefore it must be of the form C x Hi^iC/" ~ with prefactor C being a 
function of Ai, A^v- The value of such a prefactor can be easily established by comparing both 
sides as /X ^ oo: the left-hand side behaves as C ji^ , whereas expanding the determinant with 
respect to the last column and using the expression for the van der Monde determinant, Eq.(64), 
we see that the right-hand side grows as /r'^ 'W^<-j{^i ^ ^j)- 

Exploiting Eq.(131) allows us to rewrite the expectation value for the characteristic polynomial 



as 



/ 



E [Zn{ij)] oc dw(Ai) det 

J —OO 2 — 1 



1 

Ai 



V K 



N-1 



1 

A /V 



A 



N-l 
N 



( 



det 



1 

Ai 



1 

Ajv 



\ 



which can be further written down as the standard sum over all permutations Po- 
of the index set (1, 2, N): 



( 



D J —OO „_1 



.A^„-Met 



1 

Ai 



N 



1 

Ajv 



1 



(132) 



(133) 



A^^ / 



where |Pcr| = 0(1) for even (odd) permutations. The symmetry of the remaining determinant with 
respect to permutation of its columns ensures that every term in the sum above yields exactly 
the same contribution, and it is enough to consider only the first term with P„ = {1,2, ...,N), 
and multiply the result with A''!. For such a choice, the product of factors A? . . . X^~^ can be 
"absorbed" in the determinant by multiplying the j— th column of the latter with the factor \j~ , 
for all J = 1, ... , N. This gives 



E[Zm{h)](x. j ]Jdu;(Ai)det 
"'-°°i=i 



1 

Ai 



V A? 



N 



X2 
-^2 



A 



JV-l 
JV 



1 



A 



\2N-2 
2N-1 



Af-l 



N 



N 



(134) 



The integral in the right-hand side is obviously a polynomial of degree A'' in fj,, which we denote 
Dn{ij) and write in the final form as 

r^dw{x)x r^dw{x)x^ 



Dm{ij) = det 



C^dw{X)X^-^ 

V C^dw{x)x^ 



S-^dw{X)X^ 
jr^dw{X)X^+^ 



N 



/-°lrf^(A)A 



ir^dw{X)X^^-' 
jr^dw{X)X^^-^ 



1 



\ 



,,N 



(135) 
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The last form makes evident the foUowing property. Multiply the right-hand side with d'w{iJL)ijP 
and integrate over ^,. By linearity, the factor and the integration can be "absorbed" in the last 
column of the determinant. For ]3 = 0,1,...,7V — 1 this last column will be identical to one of 
preceding columns, making the whole determinant vanishing, so that 



J — ( 



dw{tJ.)tjFDN{tJ.) =0, p = 0,l,---,N -1. (136) 



Moreover, it is easy to satisfy oneself that the polynomial D]y{fi) can be written as Dn{ij) = 



Dj^-iH^ + . . ., where the leading coefficient Djsi-i = Aeiij^ d'w{\)y^^\ is necessarily 

V ' i,j=0 

positive: > 0. The last fact immediately follows from the positivity of the quadratic form: 

G{xi, ...,xn)= / dw(X) J2 -^'^^ = S ^i^i / dw{X)X'+^ 
Finally, notice that 

/oo /-oo 
d'w{fi)Dj^{fj,) = / dw{iJ,)DN{lJ-) [Dn-H-i^ + lower powers] (137) 
-oo J —oo 

/oo 
dw{n)DN{lj)lJi^ = Dn-iDn 
-oo 

where we first exploited Eq.(136) and at the last stage Eq.(135). Combining all these facts together 
we thus proved that the polynomials 7rAr(A) = . ^ D{X) form the orthogonal (and normal- 

y -Djv-i-Di\- 



ized to unity) set with respect to the given measure dw{X). Moreover, our discussion makes it 
immediately clear that the expectation value of the characteristic polynomial Z^in) for any given 
random matrix ensemble is nothing else, but just the corresponding monic orthogonal polynomial: 

E[ZM=^'k\l^), (138) 

whose leading coefficient is unity. Leaving aside the modern random matrix interpretation the 
combination of the right hand sides of the formulas Eq.(138) and Eq.(130) goes back, according to 
[6], to Heine-Borel work of 1878, and as such is completely classical. 

The random matrix interpretation is however quite instructive, since it suggests to consider 
also higher moments of the characteristic polynomials, and even more general objects like the 
correlation functions 

Cfc(Mi,M2,---,A*/c) = E [Zn{hi)Zn{ij^2) ■ ■ ■ ZNifJ^k)] ■ (139) 
Let us start with considering 

^oo POO N N N 

C2(Mi,M2)-/ dw{Xi)... dw{XN)l[{Xi-Xjfl[{fii-Xi)l[{n2-Xi). (140) 

Using the notation Ajv(Ai, . . . , Ajv) for the van der Monde determinant, see Eq.(64), we further 
notice that 

N N 

An+2{Xi, . . . , Xn, Ml, /U2) = Ajv(Ai, . . . , Xn) X (/Xi - fl2) ^(Mi - ^i) I1(M2 - Xi), 

i=l i=l 
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which allows us to rewrite the correlation function as 



C2(m1)M2) = 7 r / TTdw(Ai) Ajv(Ai,...,Ajv)Ajv+2(Ai,...,AAr,/Ui,/U2)- 

Now we replace each entry A^ in both van der Monde determinant factors with the orthogonal 
polynomial TTj{Xi) (cf. eq.(65)), and further expand the first factor as a sum over permutations: 
AAr(Ai, . . . , Xn) cx J2pi~^y^^'^oiXai) ■ ■ ■ T^N-iiXaN)- Further using pcrmutational symmetry of 
the second determinant, we again sec that every term yields after integration the same contribution. 
Up to a proportionality factor we can therefore rewrite the correlation function as 



C2(/Xl,/X2) 



/ JJdw(Ai) 7ro(Ai) . . . 7rAr_i(AAr) 
^ —00 ■_i 



(141) 



/ 



X det 



7ro(Ai) 
7ri(Ai) 



7ro(A2) 
7ri(A2) 



7rjv(Ai) 7rAr(A2) 
\ 7rjv+i(Ai) 7rjv+i(A2) 



7ro(Mi) 
7ri(/Ui) 



7^0(^2) 
7^1(^2) 



TTN+lifJ^l) nN+l{lJ.2) j 



At the next step we absorb the factors 7ro(Ai), . . . , i^ n -\{Xn^ inside the determinant by multiplying 
the first column with 7ro(Ai),..., the N—th column with ttat-i (Aat), and leaving the last two columns 
intact. By linearity, we can also absorb the product of the integrals inside the determinant by 
integrating the first column over Ai,..., and N — th column over A^r. Due to the orthogonality, 
the first N columns of the resulting determinant after integration contain zero components off- 
diagonal, whereas the entries on the main diagonal are equal to the normalization constants Cfe = 
dw{X) 7r^(A), k ~ 0, . . . , N . Therefore, the resulting determinant is easy to calculate and, up 
to a multiplicative constant we arrive to the following simple formula: 



C2(M1,/"2) oc 



1 



■det 



•^^(Mi) 7rAr(/i2) 



(a^i - A*2 

In particular, for the second moment of the characteristic polynomial we have the expression 



(142) 



E[Z'^{li)] = hm 



C2(a*i,M2) oc det 



7rAr(^) 

7r7V+i(/i) 



"AT 



(143) 



This procedure can be very straightforwardly extended to higher order correlation functions [14, 
16] , and higher order moments[15] of the characteristic polynomials. The general structure is always 
the same, and is given in the form of a determinant whose entries are orthogonal polynomials of 
increasing order. 

One more observation deserving mentioning here is that the structure of the two-point corre- 
lation function of characteristic polynomials is identical to that of the Christoffcl-Darboux, which 
is the main building block of the kernel function, Eq.(71). Moreover, comparing the above for- 
mula (143) for the gaussian case with expressions (96,95), one notices a great degree in similarity 
between the structure of mean eigenvalue density and that for the second moment of the charac- 
teristic polynomial. All these similarities are not accidental, and there exists a general relation 
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between the two types of quantities as I proceed to demonstrate on the simplest example. For 
this we recall that the mean eigenvalue density p(A) is just the one-point correlation function, see 
Eq.(42), and according to Eq.(39) and Eq.(38) can be written as 

UiiX) = N J r{X,X2,...,XN) dX2 ... Xn 

oc e-«(^) /■rfA2...A^e-Sr=.Q(^*)n(A-A,)' J] (^^"^i)'- (144) 

'' 1=2 2<i<j<N 

It is immediately evident after simple renumbering (A2, . . . , A;v) ^ (Ai, . . . , X^-i) that the integral 
in the second line allows a clear interpretation as the second moment of the characteristic poly- 
nomial E[Zf^_.^{X)] of a random matrix i?jv-i distributed according to the same joint probability 
density function V (Hn-i) dHN-i, but of reduced size — 1, sec Eq.(130) for comparison. We 
therefore have a general relation between the mean eigenvalue density and the second moment of 
the characteristic polynomial of the reduced-size matrix: 



p{X) oc e-'-^^^> l^det (^Aljv - Hn-i)\ (145) 

which explains the observed similarity. This type of relations, and their natural generalizations 
to higher-order correlation functions hold for general invariant ensembles and were found helpful 
in several applications; e.g. for the so-called "chiral" ensembles (notion of such ensembles is 
shortly discussed in the very end of these notes) in [18], for non-Hermitian matrices with complex 
eigenvalues see examples and further references in [19]); for real symmetric matrices see the recent 
paper [20]. 

Now let us discuss another important class of correlation functions involving characteristic 
polynomials, - namely one combining both positive and negative moments, the simplest example 
being the expectation value of the ratio: 



/Cjv(/U, 2^) = E 



(146) 



For such an object to be well-defined it is necessary to regularize the characteristic polynomial in 

the denominator Zm{v) = det by considering the complex-valued spectral parameter 

V such that Ivav ^ 0. Further generalizations include more than one polynomial in numerator 
and/or denominator. 

Such objects turned out to be indispensable tools in applications of random matrices to physical 

problems. In fact, in all applications a very fundamental role is played by the resolvent matrix 
(/uIat — H)^^, and statistics of its entries is of great interest. In particular, the familiar eigenvalue 
density p{v) can be extracted from the trace of the resolvent as 

p{v) = - lim ImTr ^- -. (147) 

Im/n^o- ii\n — H 

It is easy to understand that one can get access to such an object, and more general correlation 
functions of the traces of the resolvent by using the identity: 



d Zn{ij) I 



Tr^^=-^^l.=.. (148) 
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We conclude that the products of ratios of characteristic polynomials can be used to extract the 
multipoint correlation function of spectral densities (see an example below). Moreover, distribu- 
tions of sonic other interesting quantities as, e.g. individual entries of the resolvent, or statistics 
of eigenvalues as functions of some parameter can be characterized in terms of general correlation 
functions of ratios, see [21] for more details and examples. Thus, that type of the correlation 
function is even more informative than one containing products of only positive moments of the 
characteristic polynomials. 

In fact, it turns out that there exists a general relation between the two types of the correlation 
fimctions, which is discussed in full generality in recent papers [17, 22, 23]. Here we would like to 
illustrate such a relation on the simplest example, Eq.(146). To this end let us use the following 
identity: 

1 ^1^1 

and integrate the ratio of the two characteristic polynomials over the joint probability density of all 
the eigenvalues. When performing integrations, each of N terms in the sum in Eq.(149) produces 
identical contributions, so that we can take one term with fc = 1 and multiply the result by N. 
Representing A^(Ai, . . . , Ajv) = n2<i(^i~'^»)^ Y\2<i<ji^i~ ^j)^ ^ observing some cancellations, 
we have 

JV N 



j dw{M)f^^^J dw{X2)...dw{\N) n {h-Xjftl{M-\i){li-\i) 

2<i<j i=2 

oc J dw{Xi) ^~^^ X det (Ai - ^jv-i) det (/i - ^jv-i). (150) 



The average value of the products of two characteristic polynomials found by us in Eq.(142) can 
now be inserted into the integral entering Eq.(150), and the resulting expression can be again 
written in the form of a 2 x 2 determinant: 

i^jv(/u,z/) oc det i\' f i\ (151) 

where /jv {ly) stands for the so-called Cauchy transform of the orthogonal polynomial 

The emerged functions fN{^) is a rather new feature in Random Matrix Theory. It is instructive 
to have a closer look at their properties for the simplest case of the Gaussian Ensemble, Q(A) = 
NX'^/2. It turns out that for such a case the functions fN{v) arc, in fact, related to the so- 
called generalized Hermite functions Hjv which are second -non-polynomial- solutions of the same 
differential equation which is satisfied by Hermite polynomials themselves. The functions also have 
a convenient integral representations, which can be obtained in the most straightforward way by 
substituting the identity 



1 

oc 



v-X 



^^gitsgn[Imi/](iy-A) 







into the definition (152), replacing the Hermite polynomial with its integral representation, Eq.(94), 
exchanging the order of integrations and performing the A— integral explicitly. Such a procedure 
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results in 

fM+n{i^) oc rdtt.V+ng-^^(#-itsgn[Im.].)^ 
Jo 

Note, that this is precisely the integral (101) whose large-N asymptotics for real v we studied in 
the course of our saddle-point analysis. The results can be immediately extended to complex u, 
and in the "bulk scaling" limit we arrive to the following asymptotics of the correlation function 
(151) close to the origin 

f e"*'^'' if Imiv) > 
Jini^/Cjv(/i,z/) =/Coo[A^Pcx>(0)(/x-«^)], K^{r) (x ■( ^^^^ iflm(i/)>0 " ^^^^^ 

In a similar, although more elaborate way one can calculate an arbitrary correlation function 
containing ratios and products of characteristic polynomials [17, 23]. The detailed analysis shows 
that the kernel S{ii,v) = /C(/x, i/)/(/(i — v) and its scaling form Soa{r) oc ^'^J-^^ play the role of 
a building block for more general correlation functions involving ratios, in the same way as the 
Dyson kernel (118) plays similar role for the n-point correlation functions of eigenvalue densities. 
This is a new type of "kernel function" with structure different from the standard random matrix 
kernel Eq.(69). The third type of such kernels - made from functions fniy) alone - arises when 
considering only negative moments of the characteristic polynomials. 
To give an instructive example of the form emerging consider 



/Cjv(/xi,/i2, 2^1,1^2) = E 



ZnUh) ■Z'7v(M2) 



Zn{vi) Zn{v2) 
xdet 



- IJ-2){vi - V2) 

S{iJLi,vi) S {111,1^2) 
S {112,1^1) S {112,1^2) 

Assuming Im vi > 0, Im 1^2 < 0, both infinitesimal, wc find in the bulk scaling limit such that 
both 7Vp(0)/xi 2 = Ci 2 and Np{0)i'i 2 = K12 are finite the following expression (see e.g. [22], or 
[21]) 

lim ]CN{lJ'l,IJ'2,tyi,V2) =}Coo{Cl,C2,Kl,K2) (155) 
iV— >(x) 



gi7r(Ci-C2) 

C1-C2 



gi7r(«i-«2) ('^l~Cl)('^2-C2) _ (^1 - C2)(k^2 - Cl) 



(156) 

Kl - K2 Kl - K2 _ 

This formula can be further utilized for many goals. For example, it is a useful exercise to 
understand how the scaling limit of the two-point cluster function (80) can be extracted from 
such an expression (hint: the cluster function is related to the correlation function of eigenvalue 
densities by Eq.(45); exploit the relations (147), (148)). 

All these developments, - important and interesting on their own, indirectly prepared the ground 
for discussing the mathematical framework for a proof of universality in the large- A'^ limit. As was 
already mentioned, the main obstacle was the absence of any sensible integral representation for 
general orthogonal polynomials and their Cauchy transforms. The method which circumvents this 
obstacle in the most elegant fashion is based on the possibility to define both orthogonal polynomials 
and their Cauchy transforms in a way proposed by Fokas, Its and Kitaev, see references in [3] , as 
elements of a (matrix valued) solution of the following (Riemann-Hilbert) problem. The latter can 
be introduced as follows. Let the contour S be the real axis orientated from the left to the right. 
The upper half of the complex plane with respect to the contour will be called the positive one 
and the lower half - the negative one. Fix an integer n > and the measure w{z) = e""^^^^ and 
define the Riemann-Hilbert problem as that of finding a 2 x 2 matrix valued function Y = Y^"\z) 
satisfying the following conditions: 
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• r(")(z)-analyticinC\E 

Here Y^\z) denotes the limit of Y^"'\z') as 2;' i— > 2; e S from the positive/negative side of the 
complex plane. It may be proved (see [3]) that the solution of such a problem is unique and is 
given by 

y("Hz) = ( ''"^^^ , , ^"^y^ , A, Im ^ (157) 
^ ^ \^ 7„_i7r„_i(2;) 7„-i/„-i(2;) ) ^ ' 

where the constants 7„ are simply related to the normalization of the corresponding polynomials: 
7„ = -27ri[/^du;7r2]-i. 

On comparing formulae (151) and (157) we observe that the structure of the correlation function 
ICn{P', i^) is very intimately related to the above Ricmann-Hilbcrt problem. In fact, for 11, = v = z 
the matrices involved are identical (even the constant 7ra-i in Eq.(151) emerges when we replace 
oc with exact equality sign). Actually, all three types of kernels can be expressed in terms of the 
solution of the Ricmann-Hilbcrt problem. The original works [3, 24] dealt only with the standard 
kernel built from polynomials alone. From that point of view the presence of Cauchy transforms 
in the Riemann-Hilbert problem might seem to be quite mysterious, and even superfluous. Now, 
after revealing the role and the meaning of more general kernels the pictiirc can be considered 
complete, and the presence of the Cauchy transforms has its logical justification. 

The relation to the Riemann-Hilbert problem is the starting point for a very efficient method 
of extracting the large— TV asymptotics for essentially any potential fimction Q{x) entering the 
probability distribution measure. The corresponding machinery is known as the variant of the 
steepest descent/stationary phase method introduced for Riemann-Hilbert problems by Deift and 
Zhou. It is discussed at length in the book by Deift [3] which can be recommended to the interested 
reader for further details. In this way the universality was verified for all three types of kernels 
pertinent to the random matrix theory not only for bulk of the spectrum[22], but also for the 
spectral edges . In our considerations of the Gaussian Unitary Ensemble we already encountered 
the edge scaling regime where the spectral properties were parameterized by the Airy functions 
Ai{x). Dealing with ratios of characteristic polynomials in such a regime requires second solution 
of the Airy equation denoted by Bi{x), sec [27]. 

We finish our exposition by claiming that there exist other interesting classes of matrix en- 
sembles which attracted a considerable attention recently, see the paper [25] for more detail on the 
classification of random matrices by underlying symmetries. In the present framework we only 
mention one of them - the so-called chiral GUE. The corresponding 2A^ x 2A^ matrices are of the 

form Hch = ( ^ ) , where J of a general complex matrix. They were introduced to provide 

a background for calculating the universal part of the microscopic level density for the Euclidian 
QCD Dirac operator, see [26] and references therein, and also have relevance for applications to 
condensed matter physics. The eigenvalues of such matrices appear in pairs ±Afe , fc = 1, ...,N. 
It is easy to understand that the origin A = plays a specific role in such matrices, and close to 
this point eigenvalue correlations are rather different from those of the GUE, and described by 
the so-called Besscl kernels [28]. An alternative way of looking essentially at the same problem is 
to consider the random matrices of Wishart type W = J, where the role of the special point 



as z 



00 
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is again played by the origin (in such context the origin is frequently referred to as the "hard 
spectral edge", since no eigenvalues are possible beyond that point). The corresponding problems 
for products and ratios of characteristic polynomials were treated in full rigor by Riemann-Hilbert 
technique by Vanlessen[29], and in a less formal way in [27]. 
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